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4 <110> APPLICANT: Petrini, John H.J. 

5 Morgan, William Franklin / 

6 Maser, Richard Scott - 

7 Carney, James Patrick 

10 <120> TITLE OF INVENTION: DNA Encoding A DNA Repair Protein 
13 <130> FILE REFERENCE: 800.019US1 

15 <140> CURRENT APPLICATION NUMBER: US 09/837,602 
17 <141> CURRENT FILING DATE: 2001-04-18 
19 <150> PRIOR APPLICATION NUMBER: US 09/067,641 
21 <151> PRIOR FILING DATE: 1998-04-27 
23 <160> NUMBER OF SEQ ID NOS : 24 

25 <170> SOFTWARE: FastSEQ for Windows Version 3.0 

27 <210> SEQ ID NO: 1 

28 <211> LENGTH: 4403 

29 <212> TYPE: DNA 

30 <213> ORGANISM: Homo sapiens 
32 <400> SEQUENCE: 1 



33 ttcggcacga ggcgcggttg cacgtcggcc ccagccctga ggagccggac cgatgtggaa 60 

34 actgctgccc gccgcgggcc cggcaggagg agaaccatac agacttttga ctggcgttga 120 

35 gtacgttgtt ggaaggaaaa actgtgccat tctaattgaa aatgatcagt cgatcagccg 180 

36 aaatcatgct gtgttaactg ctaacttttc tgtaaccaac ctgagtcaaa cagatgaaat 240 

37 ccctgtattg acattaaaag ataattctaa gtatggtacc tttgttaatg aggaaaaaat 300 

38 gcagaatggc ttttcccgaa ctttgaagtc gggggatggt attacttttg gagtgtttgg 360 

39 aagtaaattc agaatagagt atgagccttt ggttgcatgc tcttcttgtt tagatgtctc 420 

40 tgggaaaact gctttaaatc aagctatatt gcaacttgga ggatttactg taaacaattg 480 

41 gacagraagaa tgcactcacc ttgtcatggt atcagtgaaa gttaccatta aaacaatatg 540 

42 tgcactcatt tgtggacgtc caattgtaaa gccagaatat tttactgaat tcctgaaagc 600 

43 agttcagtcc aagaagcagc ctccacaaat tgaaagtttt tacccacctc ttgatgaacc 660 

44 atctattgga agtaaaaatg ttgatctgtc aggacggcag gaaagaaaac aaatcttcaa 720 
4 5 agggaaaaca tttatatttt tgaatgccaa acagcataag aaattgagtt ccgcagttgt 780 

46 ctttggaggt ggggaagcta ggttgataac agaagagaat gaagaagaac ataatttctt 840 

47 tttggctccg ggaacgtgtg ttgttgatac aggaataaca aactcacaga ccttaattcc 900 

48 tgactgtcag aagaaatgga ttcagtcaat aatggatatg ctccaaaggc aaggtcttag 960 

49 acctattcct gaagcagaaa ttggattggc ggtgattttc atgactacaa agaattactg 1020 

50 tgatcctcag ggccatccca gtacaggatt aaagacaaca actccaggac caagcctttc 1080 

51 acaaggcgtg tcagttgatg aaaaactaat gccaagcgcc ccagtgaaca ctacaacata 1140 

52 cgtagctgac acagaatcag agcaagcaga tacatgggat ttgagtgaaa ggccaaaaga 1200 

53 aatcaaagtc tccaaaatgg aacaaaaatt cagaatgctt tcacaagacg cacccactgt 1260 

54 aaaggagtcc tgcaaaacaa gctctaataa taatagtatg gtatcaaata ctttggctaa 1320 

55 gatgagaatc ccaaactatc agctttcacc aactaaattg ccaagtataa ataaaagtaa 1380 

56 agatagggct tctcagcagc agcagaccaa ctccatcaga aactactttc agccgtctac 1440 

57 caaaaaaagg gaaagggatg aagaaaatca agaaatgtct tcatgcaaat cagcaagaat 1500 

58 agaaacgtct tgttctcttt tagaacaaac acaacctgct acaccctcat tgtggaaaaa 1560 



file://C:\Crf3\Outhol^VsrI837602.htm 2/10/02 




Page 2 of 7 



RAW SEQUENCE LISTING DATE: 02/10/2002 

PATENT APPLICATION: US/09/837,602 TIME: 13:38:33 

Input Set : N:\Crf3\RULE60\09837602.txt 
Output Set: N:\CRF3\02102002\I837602.raw 

59 taaggagcag catctatctg agaatgagcc tgtggacaca aactcagaca ataacttatt 1620 

60 tacagataca gatttaaaat ctattgtgaa aaattctgcc agtaaatctc atgctgcaga 1680 

61 aaagctaaga tcaaataaaa aaagggaaat ggatgatgtg gccatagaag atgaagtatt 1740 

62 ggaacagtta ttcaaggaca caaaaccaga gttagaaatt gatgtgaaag ttcaaaaaca 1800 

63 ggaggaagat gtcaatgtta gaaaaaggcc aaggatggat atagaaacaa atgacacttt 1860 

64 cagtgatgaa gcagtaccag aaagtagcaa aatatctcaa gaaaatgaaa ttgggaagaa 1920 

65 acgtgaactc aaggaagact cactatggtc agctaaagaa atatctaaca atgacaaact 1980 

66 tcaggatgat agtgagatgc ttccaaaaaa gctgttattg actgaattta gatcactggt 2040 

67 gattaaaaac tctacttcca gaaatccgtc tggcataaat gatgattatg gtcaactaaa 2100 

68 aaatttcaag aaattcaaaa aggtcacata tcctggagca ggaaaacttc cacacatcat 2160 

69 tggaggatca gatctaatag ctcatcatgc tcgaaagaat acagaactag aagagtggct 2220 

70 aaggcaggaa atggaggtac aaaatcaaca tgcaaaagaa gagtctcttg ctgatgatct 2280 

71 ttttagatac aatccttatt taaaaaggag aagataactg aggattttaa aaagaagcca 2340 

72 tggaaaaact tcctagtaag catctacttc aggccaacaa ggttatatga atatatagtg 2400 

73 tatagaagcg atttaagtta caatgtttta- tggcctaaat ttattaaata aaatgcacaa 2460 

74 aactttgatt cttttgtatg taacaattgt ttgtyctgtt ttcaggcttt gtcattgcat 2520 

75 ctttttttca tttttaaatg tgttttgttt attaaatagt taatatagtc acagttcaaa 2580 

76 attctaaatr tacgtaaggt aaaggactaa agtcaccctt ccaccattgt cctagctact 2640 

77 tatttttaaa taatttccta cacaaatgat agcataacat atgcagtgtt ctacaccttg 2700 

78 cttttttact tagtaagatt aaaaattata ggaatatcaa tataatgttt ttaatatttt 2760 

79 ttcttttcca ttatgctgta gtcttaccta aactctggtg atccaaacaa aatggcttca 2820 

80 gtggtgcaga tgtcacctac atgttattct agtactagaa actgaagacc atgtggagac 2880 

81 ttcatcaaac atgggtttag ttttcaccag aatggaaaga cctgtacccc tttttggtgg 2940 

82 tcttactgag ctgggtgggt gtctgttttg agcttattta gagtcctagt tttcctactt 3000 

83 ataaagtaga aatggtgaga ttgttttctt tttctacckt aaagggagat ggtaagaaac 3060 

84 aatgaatgtc ttttttcaaa ctttattgac aagtgatttt caagtctgtg ttcaaaaata 3120 

85 tattcatgta cctgtgatcc agcaagaagg gagttccagt caagagtcac tacaactgat 3180 

86 tagttgttta gagaatgaga aatggaacag tgaggaatgg aggccatatt tccatgactt 324 0 

87 cccttgtaaa cagaagcaac agaagggaca agaggctggc ctctacatca ctctcacctt 3300 

88 ccaaatcttg tggaagtgca tctacttgcc agaaccaaat taacttactt ccaagttctg 3360 

89 gctgcttgca ggtggaactc cagctgcaag ggagttaggg aaatgaaggt ctttttttaa 3420 

90 aagcttctca gccttcctag ggaacagaaa ttgggtgagc caatctgcaa tttctactac 3480 

91 aggcattgag accagttaga ttattgaaat attatagaga gttatgaaca cttaaattat 354 0 

92 gatagtggta tgacattgga tagaacatgg gatactttag aagtagaatt gacagggcat 3600 

93 attagttgat gaaatggagt catttgagtc tyttaatagc catgtatcat aattaccaag 3660 

94 tgaagctggt ggaacatatg gtctccattt tacagttaag gaatataatg gacagattaa 3720 

95 tattgttytc tgtcatgccc acaatccctt tctaaggaag actgccctac tatagcagtt 3780 

96 tttatatttg tcaatttatg aatataatga atgaggagtt ctggtacctc ctgtctttac 3840 

97 aaatattggg tgttgtccag tatttttccc tttttaaccm ttcccaattc gggtgtgtag 3900 

98 gtggatgttt ccatttgggt tttaatttgt atatccctga tagctataat tgggtcatag 396 0 

99 aaattcttta tacattctag atgcaagtct cttgycggat atacgtattg agatattaca 4020 

100 cctagtctgt ggcttgactg ttttctttat gtcttttgat gaatagaagt tttaaatttt 4080 

101 gacaaggtca aatttatttt tttcttttgt ttgatatttt ttctctccaa tttaacccca 414 0 

102 agatttcaga tattctgctc tattatataa actttatatt tttatatttg tgatctacct 4200 

103 tgaattgata tgtatgttgt gaattatgga tcagggttct ttttttcccc catacaagta 426 0 

104 tccagtcatt gtaacactgt ttattgaaag aattatcctt tcctcattaa attaccttgc 4 32 0 

105 caattagtaa aaaatcaatt aaccatrmar mmmrrrggat ccactagttc tagagcggcc 4380 

106 gccaccgcgg tggagctcca get 4403 
108 <210> SEQ ID NO: 2 
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109 <211> LENGTH: 754 

110 <212> TYPE: PRT 

111 <213> ORGANISM: Homo sapiens 
113 <400> SEQUENCE: 2 
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159 



355 



360 



365 



160 Ser Glu Gin Ala Asp Thr Trp Asp Leu Ser Glu Arg Pro Lys Glu lie 

161 370 375 380 

162 Lys Val Ser Lys Met Glu Gin Lys Phe Arg Met Leu Ser Gin Asp Ala 

163 385 390 395 400 

164 Pro Thr Val Lys Glu Ser Cys Lys Thr Ser Ser Asn Asn Asn Ser Met 

165 405 410 415 

166 Val Ser Asn Thr Leu Ala Lys Met Arg lie Pro Asn Tyr Gin Leu Ser 

167 420 425 430 

168 Pro Thr Lys Leu Pro Ser lie Asn Lys Ser Lys Asp Arg Ala Ser Gin 

169 435 440 445 

170 Gin Gin Gin Thr Asn Ser lie Arg Asn Tyr Phe Gin Pro Ser Thr Lys 

171 450 455 460 

172 Lys Arg Glu Arg Asp Glu Glu Asn Gin Glu Met Ser Ser Cys Lys Ser 

173 465 470 475 480 

174 Ala Arg lie Glu Thr Ser Cys Ser Leu Leu Glu Gin Thr Gin Pro Ala 

175 485 490 495 

176 Thr Pro Ser Leu Trp Lys Asn Lys Glu Gin His Leu Ser Glu Asn Glu 

177 500 505 510 

178 Pro Val Asp Thr Asn Ser Asp Asn Asn Leu Phe Thr Asp Thr Asp Leu 

179 515 520 525 

180 Lys Ser lie Val Lys Asn Ser Ala Ser Lys Ser His Ala Ala Glu Lys 

181 530 535 540 

182 Leu Arg Ser Asn Lys Lys Arg Glu Met Asp Asp Val Ala lie Glu Asp 

183 545 550 555 560 

184 Glu Val Leu Glu Gin Leu Phe Lys Asp Thr Lys Pro Glu Leu Glu lie 

185 565 570 575 

186 Asp Val Lys Val Gin Lys Gin Glu Glu Asp Val Asn Val Arg Lys Arg 

187 580 585 590 

188 Pro Arg Met Asp lie Glu Thr Asn Asp Thr Phe Ser Asp Glu Ala Val 

189 595 600 605 

190 Pro Glu Ser Ser Lys lie Ser Gin Glu Asn Glu lie Gly Lys Lys Arg 

191 610 615 620 

192 Glu Leu Lys Glu Asp Ser Leu Trp Ser Ala Lys Glu lie Ser Asn Asn 

193 625 630 635 640 

194 Asp Lys Leu Gin Asp Asp Ser Glu Met Leu Pro Lys Lys Leu Leu Leu 

195 645 650 . 655 

196 Thr Glu Phe Arg Ser Leu Val lie Lys Asn Ser Thr Ser Arg Asn Pro 

197 660 665 . 670 

198 Ser Gly lie Asn Asp Asp Tyr Gly Gin Leu Lys Asn Phe Lys Lys Phe 

199 675 680 685 

200 Lys Lys Val Thr Tyr Pro Gly Ala Gly Lys Leu Pro His lie lie Gly 

201 690 695 700 

202 Gly Ser Asp Leu lie Ala His His Ala Arg Lys Asn Thr Glu Leu Glu 

203 705 710 715 720 

204 Glu Trp Leu Arg Gin Glu Met Glu Val Gin Asn Gin His Ala Lys Glu 

205 725 730 735 

206 Glu Ser Leu Ala Asp Asp Leu Phe Arg Tyr Asn Pro Tyr Leu Lys Arg 

207 740 745 750 
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Gin 


Leu 


Lys 








261 


1 5 










10 














263 


<210> SEQ ID NO: 6 
























264 


<211> LENGTH: 680 
























265 


<212> TYPE: PRT 

























file://C:\Crf3\Outhold\VsrI837602.htm 



2/10/02 



Page 6 of 7 



VERIFICATION SUMMARY DATE: 02/10/2002 

PATENT APPLICATION: US/09/837 , 602 TIME: 13:38:34 

Input Set : N:\Crf3\RULE60\09837602.txt 
Output Set: N:\CRF3\02102002\I837602.raw 



L:226 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 



file://C:\Crf3\Outhold\VsrI837602.htm 



2/10/02 



